sfla based gene selection approach for improving cancer classification accuracy
نویسندگان
چکیده
in this paper, we propose a new gene selection algorithm based on shuffled frog leaping algorithm that is called sfla-fs. the proposed algorithm is used for improving cancer classification accuracy. most of the biological datasets such as cancer datasets have a large number of genes and few samples. however, most of these genes are not usable in some tasks for example in cancer classification. therefore, selection of the appropriate genes is important in bioinformatics and machine learning. the proposed method combines the advantage of wrapper and filter methods for gene subset selection. sfla-fs consists of two phases. in the first phase a filter method is used for gene ranking from high dimensional microarray data and in the second phase, sfla is applied to gene selection. the performance of sfla-fs evaluated for cancer classification using seven standard microarray cancer datasets. experimental results are compared with those of obtained from several existing well-known gene selection algorithm. the experimental results show that sfla-fs has a remarkable ability to generate reduced size of genes while yielding significant classification accuracy in cancer classification.
منابع مشابه
SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy
In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....
متن کاملSFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy
In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification. ...
متن کاملImproving Cancer Classification Accuracy Using Gene Pairs
Recent studies suggest that the deregulation of pathways, rather than individual genes, may be critical in triggering carcinogenesis. The pathway deregulation is often caused by the simultaneous deregulation of more than one gene in the pathway. This suggests that robust gene pair combinations may exploit the underlying bio-molecular reactions that are relevant to the pathway deregulation and t...
متن کاملA Novel Scheme for Improving Accuracy of KNN Classification Algorithm Based on the New Weighting Technique and Stepwise Feature Selection
K nearest neighbor algorithm is one of the most frequently used techniques in data mining for its integrity and performance. Though the KNN algorithm is highly effective in many cases, it has some essential deficiencies, which affects the classification accuracy of the algorithm. First, the effectiveness of the algorithm is affected by redundant and irrelevant features. Furthermore, this algori...
متن کاملClassification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest
Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...
متن کاملImproving Classification Accuracy Using Gene Ontology Information
Classification problems, e.g., gene function prediction problem, are very important in bioinformatics. Previous work mainly focuses on the improvement of classification techniques used. With the emergence of Gene Ontology (GO), extra knowledge about the gene products can be extracted from GO. Such kind of knowledge reveals the relationship of the gene products and is helpful for solving the cla...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
amirkabir international journal of modeling, identification, simulation & controlناشر: amirkabir university of technology
ISSN 2008-6067
دوره 47
شماره 1 2015
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023